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(D) STATE: D.C. 
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(viii) ATTORNEY/AGENT INFORMATION: 
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(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 



(v) 




FORM: 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



# 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
CGTTGTTAAG TCAATGGAAA C 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
TCTAGAGAGA AAACGTGAAT GTTGTCTCT 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 
GTACGGATCC ATGATGGTTT TCCAGCCAAT CAGTGAG 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
GTACGGTACC TTATATTGAC AGCGCACAGA GCGGG 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
GTACGGATCC ATATGTGGAA TGTTCATGGC TGGGG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
GTACGGATCC GAATTCATTT GCAAATGGTG 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 



GTACGGTACC TGATCAATGA AGACGTTATA G 31 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GTACGGATCC TGATCAGGAT TTTTCTGGTG 30 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GTACGGTACC TGATCAAAAA ATATAACCGC 30 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GTACGGATCC TGATCAAACC AAGGC CAGC A TTAC 
(2) INFORMATION FOR SEQ ID NO: 11: 



34 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTACGGTACC TTATTCTACA CAAACCGCAT AG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GTACGGATCC ACTGAAAGCA AGCGGTGGTG ATG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 ine a r 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GTACGGATCC TTCATGGTAT TCAGAAAATA C 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GTACGGATCC GACTGTCGAT GCATCAGGGA AAG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
GTACGGATCC GAATGGTAAA GGCAGTGTCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GTACGGTACC TCCAGAACGC TGCTCACTAG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
GTACGGTACC TTATTCTACA GAAACCGCAT AG 32 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATAACATGAG TACTCATGGT TG 22 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met lie Thr His Gly Cys Tyr Thr Arg Thr Arg His Lys His Lys Leu 
15 10 15 

Lys Lys Thr Leu lie Met Leu Ser Ala Gly Leu Gly Leu Phe Phe Tyr 
20 25 30 

Val Asn Gin Asn Ser Phe Ala Asn Gly Glu Asn Tyr Phe Lys Leu Gly 
35 40 45 

Ser Asp Ser Lys Leu Leu Thr His Asp Ser Tyr Gin Asn Arg Leu Phe 
50 55 60 

Tyr Thr Leu Lys Thr Gly Glu Thr Val Ala Asp Leu Ser Lys Ser Gin 



+ 



65 70 75 80 

Asp lie Asn Leu Ser Thr lie Trp Ser Leu Asn Lys His Leu Tyr Ser 
85 90 95 

Ser Glu Ser Glu Met Met Lys Ala Ala Pro Gly Gin Gin lie lie Leu 
100 105 110 

Pro Leu Lys Lys Leu Pro Phe Glu Tyr Ser Ala Leu Pro Leu Leu Gly 
115 120 125 

Ser Ala Pro Leu Val Ala Ala Gly Gly Val Ala Gly His Thr Asn Lys 
130 135 140 

Leu Thr Lys Met Ser Pro Asp Val Thr Lys Ser Asn Met Thr Asp Asp 
145 150 155 160 

Lys Ala Leu Asn Tyr Ala Ala Gin Gin Ala Ala Ser Leu Gly Ser Gin 
165 170 175 

Leu Gin Ser Arg Ser Leu Asn Gly Asp Tyr Ala Lys Asp Thr Ala Leu 
180 185 190 

Gly lie Ala Gly Asn Gin Ala Ser Ser Gin Leu Gin Ala Trp Leu Gin 
195 200 205 

His Tyr Gly Thr Ala Glu Val Asn Leu Gin Ser Gly Asp Asn Phe Asp 
210 215 220 

Gly Ser Ser Leu Asp Phe Leu Leu Pro Phe Tyr Asp Ser Glu Lys Met 
225 230 235 240 

Leu Ala Phe Gly Gin Val Gly Ala Arg Tyr lie Asp Ser Arg Phe Thr 
245 250 255 

Ala Asn Leu Gly Ala Gly Gin Arg Phe Phe Leu Pro Ala Asn Met Leu 
260 265 270 

Gly Tyr Asn Val Phe lie Asp Gin Asp Phe Ser Gly Asp Asn Thr Arg 
275 280 285 

Leu Gly lie Gly Gly Glu Tyr Trp Arg Asp Tyr Phe Lys Ser Ser Val 
290 295 300 

Asn Gly Tyr Phe Arg Met Arg Arg Trp His Glu Ser Tyr His Lys Lys 
305 310 315 320 

Asp Tyr Asp Glu Arg Pro Ala Asn Gly Phe Asp lie Arg Phe Asn Gly 
325 330 335 

Tyr Leu Pro Ser Tyr Pro Ala Leu Gly Ala Lys Leu lie Tyr Glu Gin 
340 345 350 

Tyr Tyr Gly Asp Asn Val Ala Leu Phe Asn Ser Asp Lys Leu Gin Ser 
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355 360 365 

Asn Pro Gly Ala Ala Thr Val Gly Val Asn Tyr Thr Pro lie Pro Leu 
370 375 380 

Val Thr Met Gly lie Asp Tyr Arg His Gly Thr Gly Asn Glu Asn Asp 
385 390 395 400 

Leu Leu Tyr Ser Met Gin Phe Arg Tyr Gin Phe Asp Lys Ser Trp Ser 
405 410 415 

Gin Gin lie Glu Pro Gin Tyr Val Asn Glu Leu Arg Thr Leu Ser Gly 
420 425 430 

Ser Arg Tyr Asp Leu Val Gin Arg Asn Asn Asn lie lie Leu Glu Tyr 
435 440 445 

Lys Lys Gin Asp lie Leu Ser Leu Asn lie Pro His Asp lie Asn Gly 
450 455 460 

Thr Glu His Ser Thr Gin Lys lie Gin Leu lie Val Lys Ser Lys Tyr 
465 470 475 480 

Gly Leu Asp Arg lie Val Trp Asp Asp Ser Ala Leu Arg Ser Gin Gly 
485 490 495 

Gly Gin lie Gin His Ser Gly Ser Gin Ser Ala Gin Asp Tyr Gin Ala 
500 505 510 

lie Leu Pro Ala Tyr Val Gin Gly Gly Ser Asn lie Tyr Lys Val Thr 
515 520 525 

Ala Arg Ala Tyr Asp Arg Asn Gly Asn Ser Ser Asn Asn Val Gin Leu 
530 535 540 

Thr He Thr Val Leu Ser Asn Gly Gin Val Val Asp Gin Val Gly Val 
545 550 555 560 

Thr Asp Phe Thr Ala Asp Lys Thr Ser Ala Lys Ala Asp Asn Ala Asp 
565 570 575 

Thr He Thr Tyr Thr Ala Thr Val Lys Lys Asn Gly Val Ala Gin Ala 
580 585 590 

Asn Val Pro Val Ser Phe Asn He Val Ser Gly Thr Ala Thr Leu Gly 
595 600 605 

Ala Asn Ser Ala Lys Thr Asp Ala Asn Gly Lys Ala Thr Val Thr Leu 
610 615 620 

Lys Ser Ser Thr Pro Gly Gin Val Val Val Ser Ala Lys Thr Ala Glu 
625 630 635 640 



Met Ser Ser Ala Leu Asn Ala Ser Ala Val He Phe Phe Asp Gin Thr 
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Lys Ala 

Asn Gly 

Gin Pro 
690 

Phe Asn 
705 

Ala Thr 



Ala Thr 



Phe Asp 



Val Arg 
770 

Leu Lys 
785 

Thr Ser 



Lys Gly 



Ser Tyr 



Ala Tyr 
850 

Thr Gin 
865 

Lys Tyr 



Gin Thr 



He Thr 



Tyr Ala 



Ser He 
660 

Lys Asp 
675 

Val Asn 

Gly Lys 

He Thr 

Val Ser 
740 

Glu Leu 
755 

Gly Glu 

Ala Ser 

He Ala 

Ser Val 
820 

Thr He 
835 

Tyr Ala 
Thr Val 



Ser His 



Ser Ser 
900 

Gin Asn 
915 

Val Cys 



645 

Thr Glu 

Ala He 

Asn Gin 

Ser Gin 
710 

Leu Thr 
725 

Asp Gly 

Lys He 

Leu Pro 

Gly Gly 
790 

Thr Val 
805 

Val He 

Lys Ala 

Asp Ala 

Leu Ser 
870 

Tyr Ser 
885 

Glu Gin 
Pro Leu 
Val Glu 



He Lys 

Lys Tyr 
680 

Ser Val 
695 

Thr Gin 

Ser Ser 

Ala Glu 

Asp Asn 
760 

Asn He 
775 

Asp Gly 

Asp Ala 

Lys Ala 

Pro Ser 
840 

Met Ser 
855 

Asp He 

Ser Met 

Arg Ser 

Pro Gly 
920 



650 

Ala Asp 
665 

Thr Val 



Thr Phe 



Ala Thr 



Ser Ala 
730 

Val Lys 
745 

Lys Val 



Trp Leu 



Thr Tyr 



Ser Gly 
810 

Thr Ser 
825 

Tyr Met 

He Cys 

Tyr Asp 

Asn Ser 
890 

Gly Val 
905 

Val Asn 



Lys Thr 

Lys Val 

Ser Thr 
700 

Thr Gly 
715 

Gly Lys 
Ala Thr 



Asp He 



Gin Tyr 
780 

Ser Trp 
795 

Lys Val 

Gly Asp 

He Lys 

Lys Asn 
860 

Ser Trp 
875 

He Thr 
Ser Ser 



Val Asn 



Thr Ala Val Ala 
670 

Met Lys Asn Gly 
685 

Asn Phe Gly Met 



Asn Asp Gly Arg 
720 

Ala Thr Val Ser 
735 

Glu Val Thr Phe 
750 

He Gly Asn Asn 
765 

Gly Gin Phe Lys 



Tyr Ser Glu Asn 
800 

Thr Leu Asn Gly 
815 

Lys Gin Thr Val 
830 

Val Asp Lys Gin 
845 

Leu Leu Pro Ser 



Gly Ala Ala Asn 
880 

Ala Trp He Lys 
895 

Thr Tyr Asn Leu 
910 

Thr Pro Asn Val 
925 



4 



t 



930 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

TCGAGAATGA AATAGAAGTC GTTGTTAAGT CAATGGAAAA CCTGTATTTG GTATTACATA 6 0 

ATCAGGGAAT AACATTAGAA AACGAACATA TGAAAATAGA GGAAATCAGT TCAAGCGACA 12 0 

ATAAACATTA TTACGCCGGA AGATAAAATC CGATCTATTA ATATAATTTA TTTCTCATTC 180 

TAACTCATTG TGGTGGAGCC ATAACATGAT TACTCATGGT TGTTATACCC GGACCCGGCA 24 0 

CAAGCATAAG CTAAAAAAAA CATTGATTAT GCTTAGTGCT GGTTTAGGAT TGTTTTTTTA 300 

TGTTAATCAG AATTCATTTG CAAATGGTGA AAATTATTTT AAATTGGGTT CGGATTCAAA 360 

ACTGTTAACT CATGATAGCT ATCAGAATCG CCTTTTTTAT ACGTTGAAAA CTGGTGAAAC 420 

TGTTGCCGAT CTTTCTAAAT CGCAAGATAT TAATTTATCG ACGATTTGGT CGTTGAATAA 48 0 

GCATTTATAC AGTTCTGAAA GCGAAATGAT GAAGGCCGCG CCTGGTCAGC AGATCATTTT 54 0 

GCCACTCAAA AAACTTCCCT TTGAATACAG TGCACTACCA CTTTTAGGTT CGGCACCTCT 600 

TGTTGCTGCA GGTGGTGTTG CTGGTCACAC GAATAAACTG ACTAAAATGT CCCCGGACGT 660 

GACCAAAAGC AACATGACCG ATGACAAGGC ATTAAATTAT GCGGCACAAC AGGCGGCGAG 72 0 

TCTCGGTAGC CAGCTTCAGT CGCGATCTCT GAACGGCGAT TACGCGAAAG ATACCGCTCT 78 0 

TGGTATCGCT GGTAACCAGG CTTCGTCACA GTTGCAGGCC TGGTTACAAC ATTATGGAAC 840 

GGCAGAGGTT AATCTGCAGA GTGGTAATAA CTTTGACGGT AGTTCACTGG ACTTCTTATT 900 

ACCGTTCTAT GATTCCGAAA AAATGCTGGC ATTTGGTCAG GTCGGAGCGC GTTACATTGA 960 

CTCCCGCTTT ACGGCAAATT TAGGTGCGGG TCAGCGTTTT TTCCTTCCTG CAAACATGTT 102 0 

GGGCTATAAC GTCTTCATTG ATCAGGATTT TTCTGGTGAT AATACCCGTT TAGGTATTGG 1080 

TGGCGAATAC TGGCGAGACT ATTTCAAAAG TAGCGTTAAC GGCTATTTCC GCATGAGCGG 1140 



CTGGCATGAG TCATACAATA AGAAAGACTA TGATGAGCGC CCAGCAAATG GCTTCGATAT 12 00 

CCGTTTTAAT GGCTATCTAC CGTCATATCC GGCATTAGGC GCCAAGCTGA TATATGAGCA 1260 

GTATTATGGT GATAATGTTG CTTTGTTTAA TTCTGATAAG CTGCAGTCGA ATCCTGGTGC 132 0 

GGCGACCGTT GGTGTAAACT ATACTCCGAT TCCTCTGGTG ACGATGGGGA TCGATTACCG 13 80 

TCATGGTACG GGTAATGAAA ATGATCTCCT TTACTCAATG CAGTTCCGTT ATCAGTTTGA 1440 

TAAATCGTGG TCTCAGCAAA TTGAACCACA GTATGTTAAC GAGTTAAGAA CATTATCAGG 1500 

CAGCCGTTAC GATCTGGTTC AGCGTAATAA CAATATTATT CTGGAGTACA AGAAGCAGGA 1560 

TATTCTTTCT CTGAATATTC CGCATGATAT TAATGGTACT GAACACAGTA CGCAGAAGAT 1620 

TCAGTTGATC GTTAAGAGCA AATACGGTCT GGATCGTATC GTCTGGGATG ATAGTGCATT 1680 

ACGCAGTCAG GGCGGTCAGA TTCAGCATAG CGGAAGCCAA AGCGCACAAG ACTACCAGGC 1740 

TATTTTGCCT GCTTATGTGC AAGGTGGCAG CAATATTTAT AAAGTGACGG CTCGCGCCTA 1800 

TGACCGTAAT GGCAATAGCT CTAACAATGT ACAGCTTACT ATTACCGTTC TGTCGAATGG 1860 

TCAAGTTGTC GACCAGGTTG GGGTAACGGA CTTTACGGCG GATAAGACTT CGGCTAAAGC 1920 

GGATAACGCC GATACCATTA CTTATACCGC GACGGTGAAA AAGAATGGGG TAGCTCAGGC 1980 

TAATGTCCCT GTTTCATTTA ATATTGTTTC AGGAACTGCA ACTCTTGGGG CAAATAGTGC 2 040 

CAAAACGGAT GCTAACGGTA AGGCAACCGT AACGTTGAAG TCGAGTACGC CAGGACAGGT 2100 

CGTCGTGTCT GCTAAAACCG CGGAGATGAC TTCAGCACTT AATGCCAGTG CGGTTATATT 2160 

TTTTGATCAA ACCAAGGCCA GCATTACTGA GATTAAGGCT GATAAGACAA CTGCAGTAGC 222 0 

AAATGGTAAG GATGCTATTA AATATACTGT AAAAGTTATG AAAAACGGTC AGCCAGTTAA 2280 

TAATCAATCC GTTACATTCT CAACAAACTT TGGGATGTTC AACGGTAAGT CTCAAACGCA 234 0 

AGCAACCACG GGAAATGATG GTCGTGCGAC GATAACACTA ACTTCCAGTT CCGCCGGTAA 2400 

AGCGACTGTT AGTGCGACAG TCAGTGATGG GGCTGAGGTT AAAGCGACTG AGGTCACTTT 2460 

TTTTGATGAA CTGAAAATTG ACAACAAGGT TGATATTATT GGTAACAATG TCAAGAGGTC 252 0 

GATGTTGCCT AATATTTGGC TGCAATATGG TCAGTTTAAA CTGAAAGCAA GCGGTGGTGA 2580 

TGGTACATAT TCATGGTATT CAGAAAATAC CAGTATCGCG ACTGTCGATG CATCAGGGAA 264 0 

AGTCACTTTG AATGGTAAAG GCAGTGTCGT AATTAAAGCC ACATCTGGTG ATAAGCAAAC 2700 

AGTAAGTTAC ACTATAAAAG CACCGTCGTA TATGATAAAA GTGGATAAGC AAGCCTATTA 2760 
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TGCTGATGCT ATGTCCATTT GCAAAAATTT ATTACCATCC ACACAGACGG TATTGTCAGA 282 0 

TATTTATGAC TCATGGGGGG CTGCAAATAA ATATAGCCAT TATAGTTCTA TGAACTCAAT 2880 

AACTGCTTGG ATTAAACAGA CATCTAGTGA GCAGCGTTCT GGAGTATCAA GCACTTATAA 2 940 

CCTAATAACA CAAAACCCTC TTCCTGGGGT TAATGTTAAT ACTCCAAATG TCTATGCGGT 3 000 

TTGTGTAGAA TAATTCCATA ACCACCCCGG CTAAAATATG TATTGTTTTA GTCGGGGCAT 3 060 

AATTATTTCT TCTTAAGAAA TAACCCTCTT ATAATCAAAT CTACTACTGG TCTTTTTATC 3120 

TGCTTAATAG G 3131 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3106 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GGAAAGATAA ATC CGATCTA TTAATATAAT TTATTTCTCA TTCTAACTCA TTGTGGTGGA 6 0 

GCCATAACAT GAGTACTCAT GGTTGTTATA CCCGGACCCG GCACAAGCAT AAGCTAAAAA 12 0 

AAACATTGAT TATGCTTAGT GCTGGTTTAG GATTGTTTTT TTATGTTAAT CAGAATTCAT 180 

TTGCAAATGG TGAAAATTAT TTTAAATTGG GTTCGGATTC AAAACTGTTA ACTCATGATA 240 

GCTATCAGAA TCGCCTTTTT TATACGTTGA AAACTGGTGA AACTGTTGCC GATCTTTCTA 3 00 

AATCGCAAGA TATTAATTTA TCGACGATTT GGTCGTTGAA TAAGCATTTA TACAGTTCTG 360 

AAAGCGAAAT GATGAAGGCC GCGCCTGGTC AGCAGATCAT TTTGCCACTC AAAAAACTTC 420 

CCTTTGAATA CAGTGCACTA CCACTTTTAG GTTCGGCACC TCTTGTTGCT GCAGGTGGTG 480 

TTGCTGGTCA CACGAATAAA CTGACTAAAA TGTCCCCGGA CGTGACCAAA AGCAACATGA 540 

CCGATGACAA GGCATTAAAT TATGCGGCAC AACAGGCGGC GAGTCTCGGT AGCCAGCTTC 600 

AGTCGCGATC TCTGAACGGC GATTACGCGA AAGATACCGC TCTTGGTATC GCTGGTAACC 660 

AGGCTTCGTC ACAGTTGCAG GCCTGGTTAC AACATTATGG AACGGCAGAG GTTAATCTGC 720 

AGAGTGGTGA TAACTTTGAC GGTAGTTCAC TGGACTTCTT ATTACCGTTC TATGATTCCG 780 



t 



AAAAAATGCT GGCATTTGGT CAGGTCGGAG CGCGTTACAT TGACTCCCGC TTTACGGCAA 84 0 

ATTTAGGTGC GGGTCAGCGT TTTTTCCTTC CTGCAAACAT GTTGGGCTAT AACGTCTTCA 900 

TTGATCAGGA TTTTTCTGGT GATAATACCC GTTTAGGTAT TGGTGGCGAA TACTGGCGAG 960 

ACTATTTCAA AAGTAGCGTT AACGGCTATT TCCGCATGAG GCGCTGGCAT GAGTCATACC 1020 

ATAAGAAAGA CTATGATGAG CGCCCAGCAA ATGGCTTCGA TATCCGTTTT AATGGCTATC 1080 

TACCGTCATA TCCGGCATTA GGCGCCAAGC TGATATATGA GCAGTATTAT GGTGATAATG 114 0 

TTGCTTTGTT TAATTCTGAT AAGCTGCAGT CGAATCCTGG TGCGGCGACC GTTGGTGTAA 12 00 

ACTATACTCC GATTCCTCTG GTGACGATGG GGATCGATTA CCGTCATGGT ACGGGTAATG 126 0 

AAAATGATCT CCTTTACTCA ATGCAGTTCC GTTATCAGTT TGATAAATCG TGGTCTCAGC 132 0 

AAATTGAACC ACAGTATGTT AACGAGTTAA GAACATTATC AGGCAGCCGT TACGATCTGG 1380 

TTCAGCGTAA TAACAATATT ATTCTGGAGT ACAAGAAGCA GGATATTCTT TCTCTGAATA 1440 

TTCCGCATGA TATTAATGGT ACTGAACACA GTACGCAGAA GATTCAGTTG ATCGTTAAGA 1500 

GCAAATACGG TCTGGATCGT ATCGTCTGGG ATGATAGTGC ATTACGCAGT CAGGGCGGTC 1560 

AGATTCAGCA TAGCGGAAGC CAAAGCGCAC AAGACTACCA GGCTATTTTG CCTGCTTATG 1620 

TGCAAGGTGG CAGCAATATT TATAAAGTGA CGGCTCGCGC CTATGACCGT AATGGCAATA 168 0 

GCTCTAACAA TGTACAGCTT ACTATTACCG TTCTGTCGAA TGGTCAAGTT GTCGACCAGG 174 0 

TTGGGGTAAC GGACTTTACG GCGGATAAGA CTTCGGCTAA AGCGGATAAC GCCGATACCA 1800 

TTACTTATAC CGCGACGGTG AAAAAGAATG GGGTAGCTCA GGCTAATGTC CCTGTTTCAT 186 0 

TTAATATTGT TTCAGGAACT GCAACTCTTG GGGCAAATAG TGCCAAAACG GATGCTAACG 192 0 

GTAAGGCAAC CGTAACGTTG AAGTCGAGTA CGCCAGGACA GGTCGTCGTG TCTGCTAAAA 1980 

CCGCGGAGAT GAGTTCAGCA CTTAATGCCA GTGCGGTTAT ATTTTTTGAT CAAACCAAGG 2040 

CCAGCATTAC TGAGATTAAG GCTGATAAGA CAACTGCAGT AGCAAATGGT AAGGATGCTA 2100 

TTAAATATAC TGTAAAAGTT ATGAAAAACG GTCAGCCAGT TAATAATCAA TCCGTTACAT 2160 

TCTCAACAAA CTTTGGGATG TTCAACGGTA AGTCTCAAAC GCAAGCAACC ACGGGAAATG 2220 

ATGGTCGTGC GACGATAACA CTAACTTCCA GTTCCGCCGG TAAAGCGACT GTTAGTGCGA 2280 

CAGTCAGTGA TGGGGCTGAG GTTAAAGCGA CTGAGGTCAC TTTTTTTGAT GAACTGAAAA 2340 

TTGACAACAA GGTTGATATT ATTGGTAACA ATGTCAGAGG CGAGTTGCCT AATATTTGGC 2400 



4 

i. 

I' 

(r- 

} TGCAATATGG TCAGTTTAAA CTGAAAGCAA 

r 

i 

i CAGAAAATAC CAGTATCGCG ACTGTCGATG 

GCAGTGTCGT AATTAAAGCC ACATCTGGTG 

i 
\ 

f CACCGTCGTA TATGATAAAA GTGGATAAGC 

GCAAAAATTT ATTACCATCC ACACAGACGG 

CTGCAAATAA AT ATAGC CAT TATAGTTCTA 

CATCTAGTGA GCAGCGTTCT GGAGTATCAA 

TTCCTGGGGT TAATGTTAAT ACTCCAAATG 

ACCACCCCGG CTAAAATATG TATTGTTTTA 

TAACCTCTTA TAATCAAATC TACTACTGGT 

AGAGACACAT TCACGTTTTC TAGAGTAGGT 
ATCACATCAA GCAACAACTA TGCTCACAAC 



f 

GCGGTGGTGA TGGTACATAT TCATGGTATT 2460 

CATCAGGGAA AGTCACTTTG AATGGTAAAG 2 520 

ATAAGCAAAC AGTAAGTTAC ACTATAAAAG 2 580 

AAGCCTATTA TGCTGATGCT ATGTCCATTT 264 0 

TATTGTCAGA TATTTATGAC TCATGGGGGG 2 700 

TGAACTCAAT AACTGCTTGG ATTAAACAGA 2 760 

GCACTTATAA CCTAATAACA CAAAACCCTC 2820 

TCTATGCGGT TTGTGTAGAA TAATTCCATA 2880 

GTCGGGGCAT AATTATTTCT TCTTAAGAAA 294 0 

CTTTTTATCT GCTTAATAGG TCTCTTTCAA 3000 

TGATCCAACC ACGCTGTATA CCAAAGCTGA 3 060 

ATCCACACAA TAAAAA 3106 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATGAGAGGAT CGCAYCAYCA YCAYCAYCAY GGATCCGCAT GCGACTCGGT ACCCCGGGTC 60 

GACCTGCAGC CAAGCTTAAT TAGCTGAG 8 8 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



ATGAGAGGAT CTCAYCAYCA YCAYCAYCAY ACGGATCCGC ATGCGAGCTC GGTACCCCGG 



60 



GTCGACCTGC AGCCAAGCTT AATTAGCTGA G 



91 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
ATGAGAGGAT CTCAYCAYCA YCAYCAYCAY GGGATCCGCA TGCGAGCTCG GTACCCCGGG 60 
TCGACCTGCA GCCAAGCTTA ATTAGCTGAG 90 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

CTCGAGAAAT CATAAAAAAT TTATTTGCTT TGTGAGCGGA TAACAATTAT AATAGATTCA 60 

ATTGTGAGCG GATAACAATT TCACACAGAA TTCATTAAAG AGGAGAAATT AACTATGAGA 12 0 

GGATCGCATC ACCATCACCA TCACGGATCC GCATGCGAGC TCGGTACCCC GGGTCGACCT 180 

GCAGCCAAGC TTAATTAGCT GAGCTTGGAC TCCTGTTGAT AGATCCAGTA ATGACCTCAG 240 

AACTCCATCT 250 



